Stay on Topic, Please: Aligning User Comments to the Content of a News Article
نویسندگان
چکیده
Social scientists have shown that up to $$50\%$$ of the comments posted a news article no relation its journalistic content. In this study we propose classification algorithm categorize user based on their alignment The seeks match an similarity content, entities in discussion, and topics. We BERTAC, BERT-based approach learns jointly article-comment embeddings infers relevance class comments. introduce ordinal loss penalizes difference between predicted true labels. conduct thorough show influence proposed learning process. results five representative outlets our can learn comment with $$36\%$$ average accuracy improvement comparing baselines, $$25\%$$ BA-BC. BA-BC is consists two models aimed capture dis-jointly formal language articles informal also evaluate human labeling performance understand difficulty task. agreement comment-article “moderate” per Krippendorff’s alpha score, which suggests task difficult.
منابع مشابه
the washback effect of discretepoint vs. integrative tests on the retention of content in knowledge tests
در این پایان نامه تاثیر دو نوع تست جزیی نگر و کلی نگر بر به یادسپاری محتوا ارزیابی شده که نتایج نشان دهندهکارایی تستهای کلی نگر بیشتر از سایر آزمونها است
15 صفحه اولsurveying the relevance of proportions to the content of quran verses
چکیده : قرآن چشمه سار زلال هدایتی است که از سوی خداوند حکیم نازل شده تا بشر را به سر منزل کمال برساند. و در این راستا از شیوه های گوناگون بیانی خطابی و بلاغی استفاده کرده تا با فطرت زیبا طلب انسان درآمیزد و اورا مقهور خویش ساخته، به سوی کمالات سوق دهد.ازجمله جنبه های بارز اعجاز بیانی قرآن وجود فواصل در پایان آیات است که کار برد سجع و قافیه در کلام بشر شبیه آن است. برخی ازعلمای سلف تفاوت هایی ب...
15 صفحه اولDiversifying User Comments on News Articles
In this paper we present an approach for diversifying user comments on news articles. In our proposed framework, we analyse user comments w.r.t. four different criteria in order to extract the respective diversification dimensions in the form of feature vectors. These criteria involve content similarity, sentiment expressed within comments, article’s named entities also found within comments an...
متن کاملMeasuring the Influence from User-Generated Content to News via Cross-dependence Topic Modeling
Online news has become increasingly prevalent as it helps the public access timely information conveniently. Meanwhile, the rapid proliferation of Web 2.0 applications has enabled the public to freely express opinions and comments over news (user-generated content, or UGC for short), making the current Web a highly interactive platform. Generally, a particular event often brings forth two corre...
متن کاملfrom linguistics to literature: a linguistic approach to the study of linguistic deviations in the turkish divan of shahriar
chapter i provides an overview of structural linguistics and touches upon the saussurean dichotomies with the final goal of exploring their relevance to the stylistic studies of literature. to provide evidence for the singificance of the study, chapter ii deals with the controversial issue of linguistics and literature, and presents opposing views which, at the same time, have been central to t...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-72113-8_1